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There is growing consensus tliat intervention and treatment of Huntington disease (HD) 
should occur at the earliest stage possible. Various early-intervention methods for this fatal 
neurodegenerative disease have been identified, but preventive clinical trials for HD are 
limited by a lack of knowledge of the natural history of the disease and a dearth of appro- 
phate outcome measures. Objectives of the current study are to document the natural 
history of premanifest HD progression in the largest cohort ever studied and to develop 
a battery of imaging and clinical markers of premanifest HD progression that can be used 
as outcome measures in preventive clinical trials. Neurobiological predictors of Hunting- 
ton's disease is a 32-site, international, observational study of premanifest HD, with annual 
examination of 1013 participants with premanifest HD and 301 gene-expansion negative 
controls between 2001 and 2012. Findings document 39 variables representing imaging, 
motor, cognitive, functional, and psychiatric domains, showing different rates of decline 
between premanifest HD and controls. Required sample size and models of premanifest 
HD are presented to inform future design of clinical and preclinical research. Preventive 
clinical trials in premanifest HD with participants who have a medium or high probability of 
motor onset are calculated to be as resource-effective as those conducted in diagnosed 
HD and could interrupt disease 7-1 2 years earlier. Methods and measures for preventive 
clinical trials in premanifest HD more than a dozen years from motor onset are also fea- 
sible. These findings represent the most thorough documentation of a clinical battery for 
experimental therapeutics in stages of premanifest HD, the time period for which effective 
intervention may provide the most positive possible outcome for patients and their families 
affected by this devastating disease. 

Keywords: Huntington disease, neurodegenerative disorders, premanifest, natural hiistory, clinical trials, outcome 
measures, PREDICT-HD 



INTRODUCTION 

Since 2001, Neurobiological Predictors of Huntington's Disease 
(PREDICT-HD; NS040068) has examined early indicators of 



disease in over 1300 participants at risk for Huntington disease 
(HD). Previous publications documented motor, cognitive, psy- 
chiatric, and imaging correlates of emerging disease (Paulsen et al. 



Frontiers in Aging Neuroscience 



www.frontiersin.org 



April 2014 I Volume 6 | Article 78 | 1 



Paulsen et al. 



Premanifest HD and trial feasibility 



2006, 2008; Duff et al., 2007; Biglan et al, 2009). The discovery 
that changes due to HD begin many years prior to the onset of 
diagnosable HD has fostered growing consensus that intervention 
at the earliest possible phase is desirable. Experimental pharma- 
cologic interventions are currently being tested and methods to 
silence the polyglutamine gene expansion are underway (Zhang 
et al, 2009; Ross and Tabrizi, 2011). Significant limitations to pre- 
ventive clinical trials for HD include a lack of knowledge of the 
natural history of the disease and a dearth of outcome measures 
sensitive to disease changes in the earliest, or premanifest, stages 
of the disease. 

To date, longitudinal studies have documented changes for 
12, 24, and 36 months in smaller samples of premanifest HD 
(Tabrizi et al., 2013). Changes have been demonstrated in basal 
ganglia brain volumes (Aylward et al.,2011; Majid et al, 2011) and 
declines in cognitive performance, primarily emphasizing speeded 
cognitive processing (Kirkwood et al., 1999; Paulsen et al, 2001, 
2013; Lemiere et al., 2002; Maroof et al., 201 1), executive function 
(Snowden et al, 2002; Rupp et al, 2010; O'Rourke et al., 2011; Papp 
et al, 2013), working memory (Lemiere et al., 2004), visuomotor 
control (Hart et al., 2011; Tabrizi et al., 2012, 2013), and time pro- 
duction (Rowe et al, 2010). To our knowledge, only one other 
study has examined multiple phenotypic and biologic markers of 
change over time together in premanifest HD (Tabrizi et al., 2012, 
2013). Conclusions from this group suggest imaging measures are 
useful for indexing change, but "decline in cognitive, quantitative 
motor, or oculomotor measures were limited at this stage" (Tabrizi 
etal, 2012). 

The aims of the study are twofold. First, we aim to document the 
natural history of premanifest HD progression in the largest cohort 
ever studied. Second, we aim to develop a battery of imaging and 
clinical markers of premanifest disease progression. Findings are 
interpreted in terms of their utilitarian value for preventive clini- 
cal trials. Outcome measures are scaled to facilitate comparisons, 
and hypothesized effect sizes are used to determine sample sizes 
for randomized clinical trials (RCTs). Finally, graphical analysis is 
used to illustrate the course of premanifest HD. 

MATERIALS AND METHODS 
PARTICIPANTS 

Data in this study were collected from September 2001 to August 
2012 from N = 1314 PREDICT-HD participants (1013 with pre- 
manifest HD and 301 controls) at 32 worldwide sites. All par- 
ticipants had completed genetic testing for HD prior to (and 
independent from) study enrollment. Participants with >35 CAG 
expansion repeats in the HTT gene were cases and those with 
repeats <36 served as gene-mutation negative comparison partic- 
ipants (controls). Exclusion criteria included other central nervous 
system disease, injury, or developmental disorder, or evidence of 
an unstable medical or psychiatric illness. The research protocol 
was approved by each site's respective institutional review board 
and ethics committee, and all participants gave written informed 
consent and were treated in accordance with ethical standards. 

Average years in the study (median) were six, with a range of 
1-10. Over 75% of the sample had more than 3 years of data col- 
lected, 15% had 2 years, and < 10% had 1 year. A subset of N= 204 
gene-expanded participants received a motor diagnosis during the 



study (referred to as "converters"). Dropout was less than 5% 
per year. Sample size variation was due to a number of historic 
study design events: 1) the National Institutes of Health (NIH) 
grant that funded the study was renewed three times and partici- 
pants were recruited for the duration of each individual grant; and 
2) grant reviewers increased sample size on each renewal, so the 
total sample size increased as the length of possible study duration 
decreased. 

PREMANIFEST STAGING GROUPS: CAG-AGE PRODUCT 

Premanifest stages were based on a formula using genetic and 
demographic information to estimate proximity to HD diagnosis. 
The CAG-Age Product (CAP) score, computed as CAPe = (age 
at entry) x (CAG — 33.66) (Zhang et al, 201 1), was derived from 
an accelerated failure time (AFT) model predicting motor diag- 
nosis from age at entry, CAG length, and their interaction. CAPe 
is similar to the "disease burden" score of Penney et al. (1997) 
and presumably indexes the cumulative toxicity of mutant hunt- 
ingtin. CAPe can also be used to estimate the 5-year probability 
of motor diagnosis. Cutoffs for groups were CAPe < 290 (Low), 
290 < CAPe < 368 (Medium), and CAPe > 368 (High). The esti- 
mated time to diagnosis was, respectively, > 12.78, 7.59-12.78, and 
<7. 59 years. A dynamic (time-varying) CAP score was also used, 
denoted as CAPd, and computed with current age (rather than 
age at entry). CAPd can be interpreted as a type of CAG-adjusted 
age metric (Ross et al., 2014). Table 1 shows descriptive statistics 
for demographic variables by premanifest groups defined at study 
entry (i.e., based on CAPe). 

STATISTICAL ANALYSIS 

The main analysis focused on change over time in each premanifest 
group controlling for covariates (age, education, gender, depressed 
mood severity, brain scanner field strength). Interest was in the 
comparison of premanifest and control groups. Using linear mixed 
effects regression (LMER) (Verbeke and Molenberghs, 2000), 39 
variables of interest were analyzed separately. Detailed descrip- 
tions of the variables are provided in the Supplementary Material. 
To control for site-to-site variability, a three-level model was used 
with repeated measures nested within participants, nested within 



Table 1 | Demographic variables for progression groups. 





Control" 


Low 


Medium 


High 


N 


301 


283 


358 


372 


Female 


194 (64.45%) 


190 (6714%) 


235 (65.64%) 


218 (58.60%) 


Age 


44.36 (11.41) 


34.98 (792) 


41.67 (9.56) 


44.93 (10.09) 


Education 


14.87 (2.56) 


14.57 (2.44) 


14.54 (2.61) 


14.33 (275) 


Years in 


4.39 (2.29) 


4.41 (2.68) 


4.64 (2.61) 


4.98 (2.52) 


study 










CAG 


20.27 (3.49) 


40.91 (1.62) 


42.02 (2.04) 


43.58 (2.74) 


CAPe 


NA 


243.97 (34.55) 


330.50 (23.05) 


423.19 (51.25) 


Cornverters 


NA 


11 (3.89%) 


49 (13.69%) 


144 (38.71%) 



' Percentage is based on group total. CAG, cytosine-adenine-guanine expansion; 
CAPe= (age at entry) x (CAG— 33.66). Mean (SD) is presented for a continuous 
variable and frequency (percentage) for a categorical or count variable. 
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sites. A preliminary analysis not presented showed evidence that 
linear curves were adequate for the modeling of change over time 
stratifying on CAPe group. Random intercepts and slopes were 
specified for participants, as well as for sites. The time metric for 
the analysis was duration in the study (years in the study) with 
0 = study entry. Two models were estimated for each outcome 
variable, a null model of duration and covariates only, and a full 
model adding CAPe group intercept and slope differences. Max- 
imum likelihood (ML) methods were used for estimation, which 
yield unbiased estimates under the widely- applicable assumption 
that the missing data mechanism is ignorable (Little and Rubin, 
2002). The two models (null, full) were compared using the like- 
lihood ratio test (LRT), which evaluates the null hypothesis that 
two nested models are statistically equivalent. The LRT statistic 
can be treated as an effect size measure in this case because the 
degrees of freedom are constant for each model test, and the out- 
come variables were rank-ordered according to the LRT statistic. 
To facilitate comparisons among the outcome variables, the esti- 
mated slopes from the LMER analysis were expressed in standard 
deviation (SD) units. To produce these estimates, each outcome 
was scaled using the grand mean and SD prior to the LMER analy- 
sis. Interest was in the comparison of premanifest and control 
groups. Z-tests of slope differences were computed as the esti- 
mated difference divided by its standard error. The control group 
Z-test was a test against a zero slope value. Three ancillary analy- 
ses were conducted. The first analysis examined possible effects 
of conversion. The second focused on required sample size for a 
hypothetical RCT The third was a graphical analysis of trends over 
all progression periods for eight of the key variables. Variables were 
chosen conceptually to represent the phenotypic characteristics of 
HD (motor, cognitive, psychiatric) as well as to represent biologi- 
cal (imaging) and functional outcomes. Details of all analyses are 
presented in the Supplementary Material. 

RESULTS 

Table 2 shows the LMER results. The variables with the largest 
effect sizes (LRT statistics) were imaging measures based on 
regional brain volumes (corrected for intra-cranial volume and 
controlled for change in field strength). The two top-ranked mea- 
sures were the putamen and caudate structure volumes. The slopes 
for the controls showed significant decrease over time (consis- 
tent with normal aging), but the decline in the gene-expansion 
groups steadily decreased over time for all groups. The slope for 
each premanifest group was statistically different from the Con- 
trol group (allps < 0.001). Other imaging variables demonstrating 
significant change relative to controls included accumbens, cere- 
bral spinal fluid (CSF), lobar gray, hippocampus, and lobar white 
(though only for the High group) . The putamen, caudate, CSF, and 
lobar gray measures showed significant longitudinal change in all 
three premanifest groups. A graphical depiction of change in brain 
volume for the groups is shown in Figure SI in Supplementary 
Material. 

Total motor score (TMS) from the Unified Huntington's Dis- 
ease Rating Scale (UHDRS) showed the next highest effect size 
in change rates over time (rank four). The Control group slope 
was not statistically different from zero, and the Low group slope 
was not statistically different from the Control group slope. The 



Medium group slope was significantly larger than the Control 
slope, as was the High slope (all ps < 0.001). The next strongest 
effects were for bradykinesia (rank five) and chorea (rank six). 

DecHne in cognitive performance was significant in every mea- 
sure examined. Symbol Digit Modality Test (SDMT) had the 
seventh-strongest effect over all measures. The Control group 
slope was positive and significantly greater than zero, indicating 
a practice effect. The Low group slope was negative and worse 
than the Control group slope. The Medium slope showed greater 
decline than the Low slope, and the High slope showed even 
greater decline. All cognitive measures examined showed signif- 
icant changes in the High group, and 9 of 10 cognitive measures 
showed significant change in the Medium group. Four cognitive 
measures (SDMT, Stroop-word, SmeU-ID, and the Trail Making 
Test) showed significant change over time in the Low group. 

Regarding the functional variables, every measure examined 
showed significant change over time compared with the controls, 
except for the participant-rated World Health Organization Dis- 
ability Assessment Schedule (WHODAS). The total functional 
capacity (TEC) scale (rank 20) showed the largest effect in the 
High group. The Control group slope and the Low group slope 
were not statistically different than zero. The Medium group slope 
showed significant decline, and the High group slope even more so. 
Though the Everyday Cognition Rating Scale (ECog) companion 
total had a weaker effect size (rank 26), the Low group was statisti- 
cally different than the Control group, as were the higher progres- 
sion groups. Four of the six functional measures showed robust 
change in the Medium group and three showed change in the Low 
group. The functional measures most appropriate for the different 
stages of premanifest HD varied. Whereas the TEC showed the 
greatest effect size in the High group, the companion WHODAS 
and ECog showed greater change in the Medium and Low groups. 

Eight of the nine psychiatric variables showed significant 
change over time relative to controls. The SCL-90 obsessive- 
compulsive scale had the strongest effect (rank 22). There was 
an increase in obsessive-compulsive signs over time as the pro- 
gression group increased (Control through High). Frontal Sys- 
tems Behavioral Scale (ERSBE) executive and apathy subscales, 
ranked 24 and 25, respectively, also showed robust effect sizes. 
Seven of nine measures showed significant change in the Medium 
group and one psychiatric measure (ERSBE disinhibition) showed 
significant longitudinal change in the Low group. 

Table 3 shows the six variables that had a statistically significant 
acceleration of the slope for the participants who converted. The 
fourth column (acceleration) shows the value added to a slope in 
Table 2 to indicate the additional deterioration associated with 
conversion. The variables are sorted by proportionate increase, 
with dystonia having the largest change under conversion (approx- 
imately 2.1 times faster decline). The acceleration factor is most 
applicable for the High group because the largest proportion of 
conversion occurred in this group (see Table 1). The acceleration 
factors ranged from a mild added slope acceleration of 0.06 SD 
per year for TMS, to a strong acceleration of 0.20 SD per year for 
dystonia. 

Table 4 shows the single-group estimated sample size as a func- 
tion of dropout percentage, effect size, and estimated parameters 
for a hypothetical RCT of efficacy consistent with guidelines for 
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Table 2 | Linear mixed effects regression results sliowing annual rate of change (slope) and rank-order based on effect size. 



Variable Type N N* Progression group LRT Rank 











Control 


Low 


Medium 


High 






Putamen 


Imaging 


1206 


2845 


-0.0407*** 


-0.0900*** 


-0.1122*** 


-0.1119*** 


714.38*** 


1 


Caudate 


Imaging 


1207 


2846 


-0.0331** 


-0.0769*** 


-0.1028*** 


-0.1192*** 


515.33*** 


2 


Accumbens 


Imaging 


1207 


2846 


-0.0264 


-0.0279 


-0.0853*** 


-0.1110*** 


417.38*** 


3 


CSF 


Imaging 


1205 


2840 


0.0511*** 


0.0804** 


0.1021*** 


0.0978*** 


223.19*** 


11 


Lobar gray 


Imaging 


1175 


2768 


-0.0874* 


-0.1220** 


-0.1293*** 


-0.1523*** 


125.23*** 


19 


Hippocannpus 


Imaging 


1206 


2845 


-0.0337** 


-0.0275 


-0.0626** 


-0.0910*** 


98.33*** 


23 


Lobar white 


Imaging 


1149 


2680 


0.0223 


0.021 


0.0122 


-0.0220*** 


63.73*** 


29 


Thalamus 


Imaging 


1204 


2843 


-0.0327* 


-0.0217 


-0.0362 


-0.0487 


18.69** 


39 


TMS 


Motor 


1308 


6077 


0.0016 


0.0332 


0.0933*** 


0.2397*** 


415.75*** 


4 


Chorea 


Motor 


1310 


6124 


0.0028 


0.0395 


0.0935*** 


0.2430*** 


332.72*** 


5 


Brady 


Motor 


1308 


6102 


-0.003 


0.0297 


0.0790*** 


0.2063*** 


331.14*** 


6 


Ocular 


Motor 


1309 


6124 


0.0008 


0.0097 


0.0683*** 


0.1397*** 


293.61*** 


8 


Dystonia 


Motor 


1310 


6130 


-0.0009 


0.0295 


0.0524* 


0.1698*** 


114.56*** 


21 


Rigidity 


Motor 


1310 


6127 


0.0203 


0.0406 


0.028 


0.0744*** 


38.58** 


36 


SDMT 


Cognitive 


1230 


5252 


0.0287** 


-0.0012** 


-0.0336*** 


-0.0800*** 


303.63*** 


7 


riwQ r h\/+ h m i a 
L.'yoi 1 ly LI u 1 iici 


v^tjy 1 11 LIVB 


969 


2270 


—0 0107 


— n nnsQ 

— U.UUO\7 


n n4RR* 

VJ.VJ^UO 


0.1279*** 


236.60*** 


g 


Stroop-Co 


Cognitive 


1228 


5246 


0.0327*** 


0.0103 


-0.0284* •* 


-0.0811*** 


234.50*** 


10 


Stroop-Wo 


Cognitive 


1228 


5255 


0.0056 


-0.0201* 


-0.0416*** 


-0.0986*** 


222.97*** 


12 


Sp-Tapping 


Cognitive 


975 


2278 


0.0046 


0.038 


0.0629** 


0.1476*** 


214.97*** 


13 


Smell-ID 


Cognitive 


1212 


3745 


0.0004 


-0.0372* 


-0.0559*** 


-0.1283*** 


189.31*** 


14 


TMT-B 


Cognitive 


1221 


3875 


-0.02 


0.0112 


0.0188* 


0.0878*** 


178.52*** 


15 


Stroop-ln 


Cognitive 


1228 


5245 


0.0452*** 


0.0252 


0.0068** 


-0.0468*** 


172.69*** 


16 


EmoRec 


Cognitive 


978 


2300 


0.0342 


0.049 


0.0466 


-0.0302** 


163.05*** 


17 


TMT-A 


Cognitive 


1223 


3905 


-0.0385* 


0.0024* 


0.0199** 


0.1048*** 


141.71*** 


18 


TFC 


Functional 


1308 


6140 


-0.0184 


-0.0443 


-0.0870*** 


-0.2093*** 


124.03*** 


20 


ECog-C 


Functional 


795 


1599 


-0.0172 


0.0469* 


0.0576** 


0.1074*** 


68.24*** 


26 


ECog-P 


Functional 


899 


1928 


-0.0228 


0.0292* 


0.0117 


0.0408** 


50.05** 


32 


FAS 


Functional 


1052 


4160 


-0.0071 


-0.0417 


-0.0692* 


-0.1688*** 


43.20*** 


35 


WHODAS-C 


Functional 


708 


1254 


0.0105 


0.0597 


0.0859* 


0.1193*** 


37.56** 


37 


WHODAS-P 


Functional 


769 


1499 


0.0199 


0.0691 


0.0522 


0.0328 


33.04** 


38 


S-OC-C 


Psychiatric 


1236 


5392 


-0.0293* 


0.0027 


0.0175** 


0.0649*** 


100.54*** 


22 


F-Exc-C 


Psychiatric 


1235 


5288 


-0.0226 


-0.0036 


0.0288** 


0.0705*** 


97.95*** 


24 


F-Apa-C 


Psychiatric 


1235 


5288 


-0.0158 


0.0131 


0.0346** 


0.0706*** 


80.71*** 


25 


S-GSI-C 


Psychiatric 


1237 


5395 


-0.0398*** 


-0.0096 


-0.0016* 


0.0404*** 


66.53*** 


27 


S-Dep-C 


Psychiatric 


1236 


5392 


-0.0391** 


-0.0054 


-0.0048* 


0.0352*** 


65.15*** 


28 


S-Anx-C 


Psychiatric 


1237 


5393 


-0.0339** 


-0.01 


-0.0031* 


0.0345*** 


53.14*** 


30 


BDI 


Psychiatric 


1042 


4120 


-0.0063 


-0.0044 


-0.04 


0.0191 


52.88** 


31 


F-Dis-C 


Psychiatric 


1235 


5288 


-0.0219 


0.0137* 


0.0295*** 


0.0575*** 


49.57*** 


33 


S-Hos-C 


Psychiatric 


1236 


5392 


-0.0256* 


-0.0208 


-0.0063 


0.0166** 


46.45** 


34 



Imaging variables adjusted for baseline Intra-cranial volume. N, sample size; W* number of data points; LRT, likelihood ratio test statistic; *p< 0.05, **p< 0.01, 
***p < 0.001. Control p-value is a test against a zero slope and a CAP group p-value Is a test against the Control slope. Progression Group (CAP group) is defined at 
study entry (CAP,,). Progression Group columns show the standardized annual rate of change (slope). Rank (last column) is based on the likelihood ratio test statistic. 
CSf: cerebral spinal fluid; TMS, total motor score; Brady, bradykinesia; SDMT Symbol Digit Modalities Test; Stroop-Co, Stroop Color and Word Test - color condition; 
Stroop-Wo, Stroop Color and Word Test - word condition; SP-Tapping, speeded tapping; Smell-ID, University of Pennsylvania Smell Identification Test (UPSIT); TMT-B, 
Trail Making Test, Part B; Stroop-ln, Stroop Color and Word Test - interference condition; EmoRec, emotion recognition test; TMT-A, Trail Making Test, Part A; TFC, 
total functional capacity; ECog-C, Everyday Cognition Rating Scale - Companion Rating Scale; ECog-P Everyday Cognition Rating Scale - Participant Rating Scale; 
FAS, functional activity scale; WHODAS-C, World Health Organization Disability Assessment Schedule - companion rating scale; WHODAS-P World Health Orga- 
nization Disability Assessment Schedule - participant rating scale; S-OC-C, Symptom Checklist 90-obsessive-compulsive scale - companion rating scale; F-Exc-C, 
Frontal Systems Behavioral Scale - executive subscale - companion rating scale; F-Apa-C, Frontal Systems Behavioral Scale - apathy subscale - companion rating 
scale; S-GSI-C, Symptom Checklist 90 - Global Severity Index - companion rating scale; S-Dep-C, Symptom Checklist 90 - depression subscale - companion rating 
scale; S-Anx-C, Symptom Checklist 90 - anxiety subscale - companion rating scale; BDI, Beck Depression Inventory-ll; F-Dis-C, Frontal Systems Behavioral Rating 
Scale - disinhibition subscale - companion rating scale; S-Hos-C, Symptom Checklist 90 - hostility subscale - companion rating scale. 



Frontiers in Aging Neuroscience 



www.frontiersin.org 



April 2014 I Volume 6 | Article 78 | 4 



Paulsen et a 



Premanifest HD and trial feasibility 



Table 3 | Slope acceleration effect of motor diagnosis. 



Variable 


Type 


Chisq 


Acceleration 


Increase 


Dystonia 


Motor 


91.77*** 


0.191182 


2.125979 


Stroop-Co 


Cognitive 


42.02*** 


-0.07891 


1.973548 


FAS 


Functional 


26.80*** 


-0.15594 


1.923999 


SDMT 


Cognitive 


29.57*** 


-0.06183 


1.773312 


TFC 


Functional 


29.49*** 


-0.08748 


1.417992 


TMS 


Motor 


22.20*** 


0.055938 


1.233343 



*p<0.05, "p<0.01, ***p< 0.007. Stroop-Co, Stroop Color and Word Test - 
color condition: FAS, functional activity scale; SDMT, Symbol Digit Modalities Test; 
TFC, total functional capacity: TMS, total motor score. 



Phase II trials (The Lancet Neurology, 2012). Listed in the effect 
size columns (20%, etc.) are the estimated required sample size. 
Results are listed for variables that required N/2 < 3000 for a 20% 
effect and 20% dropout. CSF had the smallest single-group sample 
size (e.g., N/2 = 27 for a 70% effect with no dropout), followed 
by putamen, caudate, TMS, speeded tapping, and additional vari- 
ables. An approximate 70% difference in slopes was obtained in 
a recent clinical efficacy trial for HD (Huntington Study Group, 
2006). Sample sizes do not take into account change associated 
with normal aging and thus may overestimate the effect sizes 
for differences between treatment versus placebo groups for any 
treatment that only addresses disease-related change. 

Figure 1 shows curves of individual participants (thin colored 
lines) and cubic spline curves (thick black lines) by premanifest 
stage. For some of the variables, change was slower for earlier pre- 
manifest stages but accelerated with proximity to average motor 
onset value (vertical line) with a greater acceleration thereafter. 

Figure 2 shows a model of disease progression for eight vari- 
ables throughout the course of premanifest HD. The curves are 
based on a cubic spline fit after standardizing each variable rela- 
tive to controls. The imaging, cognitive, and psychiatric variables 
showed linear increase over all premanifest stages, whereas motor 
and functional variables tended to show a non-linear trajectory 
with a sudden acceleration just prior to motor onset. Mean years 
in the study were 6, with a range of 1-10. Over 75% of the sample 
had >3 years of data, 15% had 2 years, and <10% had 1 year. A 
subset of N = 204 gene-expanded participants received a motor 
diagnosis during the study, referred to as "converters." Dropout 
was less than 5% per year. Sample size variation was due to a num- 
ber of historic study design events: 1 ) the NIH grant that funded 
the study was renewed three times and participants were recruited 
for the duration of each individual grant; and 2) grant reviewers 
increased sample size on each renewal, so the total sample size 
increased as the length of possible study duration decreased. 

DISCUSSION 

Findings show longitudinal change in 36 of 39 measures exam- 
ined over a 10-year natural observation study in premanifest HD. 
Effect sizes suggest a preventive RCT could be efficiently designed 
to detect treatment effects in the neighborhood of 30%, and effects 
similar to the recent tetrabenazine efficacy trial (about 70% effect) 
might be found with sample sizes near to 30 per arm, depend- 
ing on the amount of dropout (Huntington Study Group, 2006). 



Significant measures include each of the clinical phenotypic char- 
acteristics of HD (motor, cognitive, psychiatric), as well as biologic 
and functional outcomes. No previous study has so thoroughly 
documented a clinical battery for experimental therapeutics in 
stages of premanifest HD. Current findings dovetail with those 
generated from smaller studies. Most importantly, the specific 
measure chosen for each of the primary components typically 
measured in HD is dependent upon the disease stage of the pre- 
manifest cohort targeted for intervention. For example, the best 
cognitive variable for a clinical trial in persons who are < 8 years 
to motor onset (High group) is tapping speed because it had the 
largest significant absolute value standardized slope (=0.1476). 
On the other hand, the most robust measure for tracking cogni- 
tive change in persons who are more than a dozen years to onset 
(Low group) is smell identification (see Table 2). 

It is important to note that actual design of clinical trials 
involves many factors. Trial design will need to balance feasi- 
bility of cost-effective, multi-site research with the expense of 
advanced technology methods that may limit resources and result 
in fewer treatments being evaluated (Ross and Tabrizi, 2011). Ide- 
ally, clinical trials for HD will involve intervention at multiple 
and differing points along the cascade of changes that are known 
to occur from gene expansion to patient suffering. Hence, inter- 
ference with the disease processes could involve gene silencing, 
altering posttranslational modifications of Huntington, ameliora- 
tion of gene transcription abnormalities, or buttressing metabolic 
abnormalities to thwart the devastation that people with HD and 
their families endure. Different outcome measures may also have 
different kinds of utility for Phase I versus Phase II trials or symp- 
tomatic versus disease-modifying strategies. It is important to keep 
in mind that the measurements listed in Table 4 do not take into 
account changes related to normal aging. Changes in CSF vol- 
ume, for example, are significant for premanifest participants but 
also demonstrate relatively large effect sizes for controls. Thus, a 
treatment that targets disease-related change, but not age-related 
change, may be better evaluated with measures that are more sen- 
sitive to longitudinal differences between premanifest cases versus 
controls. 

Much recent attention has been devoted to the importance of 
natural history studies in the design of clinical trials^. In HD 
literature, several authors have developed conceptual models of 
natural history of the disease. None of the models are based on 
actual data, however (see text footnote 1 ) . Findings from this study 
were used to develop a natural history model that is based on up 
to 10 years of natural observation spanning a substantial range 
of the premanifest period. The model can be used to develop 
progression-based care guidelines as well as to design clinical 
trials. Importantly, these data can provide external historical con- 
trols facilitating single-treatment-group or dose-selection trials 
without an active randomized placebo group (Elm et al., 2005; 
Czaplinski et al., 2006). Figure 2 provides a data-driven model 



^Workshop on Natural History Studies of Rare Diseases: meeting the Needs of 
Drug Development and Research." [U.S. Department of Health & Human Services, 
National Institutes of Health and U.S. Food and Drug Administration workshop, 
Bethesda, MD, USA May 16-17, 2012, https://events-support.com/events/Natural_ 
History_Studies/page/ 8 7 ] 
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Table 4 | Estimated required sample size (right side) for a two-arm pliase 11 randomized clinical trial. 



Variable Type Dropout (%) Estimated parameters Effect size 
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Effect size is the percentage difference in rate of cfiange (slope) of the treated and untreated groups (see Supplementary Material). CSF, cerebral spinal fluid; TMS, 
total motor score; SP-Tapping, speeded tapping; Brady, bradykinesia; TFC, total functional capacity; SDMT Symbol Digit l^odalities Test. 



of disease progression and illustrates progression using at least 
one variable from each clinical domain (motor, cognitive, psychi- 
atric), biological evidence of decline (imaging variables), as well as 
a functional variable to encompass regulatory requirements. For 
practical purposes we chose outcomes that illustrated the great- 
est change for the High group, although the best choice for any 
specific clinical trial will vary dependent upon the treatment, the 
target and the premanifest cohort. For instance, the finding that 



the TFC did not show change in the Low and Medium groups, 
coupled with the sample sizes necessary to show a drug effect on 
TFC in the High group suggests that this measure is not likely to 
be an appropriate candidate for premanifest intervention in early- 
phase trials (for Phase 3, the Food and Drug Administration [FDA] 
wants to see a functional outcome). Efforts to provide a clinically 
meaningful outcome (as requested by regulatory agencies) may 
require consideration of other markers and extensive discussions 
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FIGURE 1 I Plots of key outcome variables for preventive clinical 
trials. Empirical individual curves (thin lines) and fitted spline curves 
(thick lines) of key variables plotted over CAPo (CAG-corrected age). 
Dashed lines indicate individuals who converted in the study. TMS 



with regulators. Such findings have immediate implications for 
current design of RCTs. 

A primary strength of the PREDICT-HD study is that over 1 300 
gene-mutation-tested participants were prospectively followed up 
to, and, for some, through the point of actual motor diagnosis. 



indicates total motor score; TFC, total functional capacity; SDMX 
Symbol Digit Modalities Test; Smell-ID, University of Pennsylvania 
Smell Identification Test (UPSIT); Obsess-Comp, obsessive-compulsive; 
CS Fluid, cerebral spinal fluid. 



Such data can be effective to document phenotypic and biologic 
changes that occur in persons with the gene expansion over the 
decade prior to, and, just after the manifestation of disease. Find- 
ings suggest the course of biologic progression in premanifest HD 
appears linear for imaging, cognitive, and psychiatric data, but 
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FIGURE 2 I Data-derived model of premanifest Huntington disease. 

Scaled fitted spline curves of key variables plotted over CAPp (CAG-corrected 
age). Blue colors indicate variables that decrease and red colors indicate 
variables that increase. Vertical axis is in standard deviation (SD) units. TMS 



non-linear for motor and functional data (see Figure 2). Motor 
expression appears to accelerate as the disease manifests over the 
course of approximately 15 years prior to motor onset. It is impor- 
tant to note that increases in the number and severity of clinical 
outcomes reflect measurement aspects of the manifestation of dis- 
ease and do not necessarily reflect a curvilinear, or increased, rate 
of disease progression. One possible explanation maybe that atro- 
phy of each individual brain region (e.g., putamen as shown in 
Figures 1 and 2) proceeds relatively linearly, but as additional 
brain regions begin to undergo degeneration and dysfunction, 
their combined effect causes acceleration of the clinical expression 
of disease. 

Additional strengths include the large, worldwide collabora- 
tion among 32 sites and across brain scanners, cultures, and 
languages, involving multiple disciplines and specialties. The 
PREDICT-HD study may be most relevant to actual clinical tri- 
als where multiple sites are likely to be used to acquire sufficient 
sample sizes (The Lancet Neurology, 2012). Finally, the care 
with which the study was conducted provides quality control, 
quality assurances, standardization protocols, and statistical con- 
trol for many common confounds (age, gender, education) and 
not-so-common confounds (depressed mood, field strength). All 
components of the study are shared worldwide to assure findings 
can be replicated and utilized in the future as more knowledge 



indicates total motor score; CSF, cerebral spinal fluid; DYS, dysrhythmia; 0-C, 
obsessive-compulsive; SDMT, Symbol Digit Modalities Test; SMELL, 
University of Pennsylvania Smell Identification Test {UPSIT);TFC, tota 
functional capacity. 



is acquired about HD and other neurodegenerative disorders, 
so PREDICT-HD data can continue to facilitate progress for 
decades onward. 

Weaknesses also exist in the study. Interval to follow-up was 
1 year for clinical and cognitive assessment and 2 years for imag- 
ing, and clinical trials demand more frequent assessments. It is 
recommended that the measures proposed be subjected to a brief 
repeated measures study over a period of 6 months to assure that 
more rapid assessment can be documented. Finally, due to the 
length of the PREDICT-HD study, some protocol changes were 
unavoidable. A common-but-important variation in our study 
was the change in MRI scanners from 1.5 to 3 T. The basal gan- 
glia structures were processed to accommodate this change, and 
field strength was statistically controlled in the analysis. Solutions 
to varying field strength and other methodological challenges of 
PREDICT-HD can be made available to researchers who are inter- 
ested in natural history studies. The lobar white matter measures 
reported in this work incorporate data from much more heteroge- 
neous data collection than were reported in previous publications. 
The expanded subject inclusion for lobar white matter measures 
included both the 1.5 and 3 T scanners from a larger number 
of scanning sites. While the strength of white matter lobar mea- 
sures is less than previously reported, we believe this is due to 
inherent measurement variability introduced by heterogeneous 
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data collection. Systematically addressing the variability due to 
heterogeneous data collection is a focus of ongoing work. 
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